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We construct a reference benchmark set for atomic and molecular random-phase-approximation 
(RPA) correlation energies in a density functional theory (DFT) framework at the complete basis 
set limit. This set is used to evaluate the accuracy of some popular extrapolation schemes for 
RPA all-electron molecular calculations. The results indicate that for absolute energies accurate 
results, clearly outperforming raw data, are achievable with two-point extrapolation schemes based 
on quintuple- and sextuple-zeta basis sets. Moreover, we show that results in good agreement 
with the benchmark can also be also obtained by using a semiempirical extrapolation procedure 
based on quadruple- and quintuple-zeta basis sets. Finally, we analyze the performance of different 
extrapolation schemes for atomization energies. 
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INTRODUCTION 

The random phase approximation (RPA) for electron 
correlation was originally developed in the context of the 
many-body perturbation treatment of the uniform elec- 
tron gas [l|, |2| and later reformulated in the framework 
of density functional theory (DFT) via the adiabatic- 
connection fluctuation-dissipation theorem Q. In the 
last years, RPA correlation witnessed an increasing in- 
terest in quantum chemistry [3-E7|. due to its ability 
to describe with good accuracy a rather large variety of 
reaction and interaction energies. In this context RPA 
is generally employed as a post Kohn-Sham (KS) ap- 
proach where the exact exchange (EXX) and the corre- 
lation energies are evaluated using the eigenvalues and 
eigenfunctions from a KS calculation with conventional 
exchange-correlation (XC) functionals. EXX KS orbitals 



30[ have been also employed via the optimized effec- 
tive potential (OEP) method, while for a fully variational 
app roach self-consistent EXX-RPA orbitals are required 
3lM38l| . In all cases practical applications of the method 
are often hindered by the the very slow convergence of 
the RPA-correlation energy with the basis set dimension 



39I 40j . which is related to the electron cusp problem 
J], in analogy with wavefunction methods. The cusp 
problem is strongly reduced for range-separated RPA 
[llMlSl . klL 1421 , whereas for full-range RPA approaches 



extrapolation to the complete basis set (CBS) must be 
employed. 

Basis set extrapolation to the CBS limit has been lon- 
gley studied for post Hartree-Fock correlated methods, 
with the aim of reducing as much as possible the ba- 
sis set truncation error without resorting to the brute 
force employment of extremely large basis sets. Numeri- 
cal work [43 4^ on various correlation treatments of the 
helium atom has shown that the basis set incomplete- 
ness error is approximately proportional to the inverse 



third power of the maximum angular momentum, pro- 
vided that all angular momenta are radially saturated. 
However, the ideal conditions of fully saturated angular 
momenta in an atomic system are rather distant from 
those encountered in real applications on molecules with 
finite basis sets. Nevertheless, the correlation energy has 
been empirically observed to converge regularly enough 
to allow the development of various accurate extrapola- 
tion schemes which have been assessed in several studies 
in literature 50-54]. On the other hand, no such study 
has been performed for RPA correlation energies. 



In this paper we aim at filling this gap and shed light 
on the usefulness and limits of different extrapolation 
schemes for RPA correlation energies. In this respect the 
goal of the present work is to: (1) create a set of bench- 
mark CBS RPA absolute correlation energies to be used 
as reference for future assessments; (2) validate some ex- 
trapolation schemes, usually applied in the context of 
coupled cluster theory, for the calculation of CBS RPA 
absolute energies, including semiempirical techniques to 
approach the extrapolation from relatively small basis 
sets; (3) test the convergence behavior of different ba- 
sis sets and the reliability of the extrapolations schemes 
for atomization energies, which exploit a significant er- 
ror cancellation effect. Absolute correlation energies will 
be considered, as in many corresponding studies of cor- 
related methods, because these converge regularly with 
the basis set and provide an ideal quantity for the study 
of extrapolation schemes. In this paper we focus on light 
elements because they provide a simpler convergence of 
the correlation energy and can be more easily investi- 
gated with large basis sets. 
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COMPUTATIONAL DETAILS 

RPA calculations were performed using the eigenval- 
ues and the orbitals from DFT calculations based on 
the Perdew-Burke-Ernzerhof (PBE) exchange-correlation 
functional 55]. In all calculations Dunning's correlation- 
consistent basis sets l^-I^I (cc-pVnZ, n = _D,5,6,7) 
aug mented with core and core-valence basis functions 

60l462| were employed. These are hereafter denoted as 
VnZcv with n=4, 5, 6, and 7. Augmented diffuse basis 
functions were not considered in the present work since 
they were shown to slow down RPA basis set convergence 
without bringing substantial benefits in most cases (40| . 
Also, no basis set superposition error correction was con- 
sidered for atomization energies. For molecules experi- 
mental geometries were considered G^-lll- AH calcula- 
tions were performed with the TURBOMOLE program 

ackage [77[ using the implementation described in Ref. 



REFERENCE DATA 

The construction of a set of benchmark CBS-limit RPA 
energies is a fundamental step before any possible assess- 
ment work on extrapolation schemes. Unfortunately, in 
the case of RPA correlation energies, this is a difficult 
task. In fact, unlike for MP2 or coupled cluster meth- 
ods, no explicitly correlated Q RPA data exists that 
can serve as an accurate reference. At the same time, a 
brute force strategy, based on the use of very large ba- 
sis sets, is hampered by the extremely slow convergence 
rate of the RPA calculations. Thus, in order to approach 
the true CBS limit as close as possible and provide accu- 
rate reference CBS energies, some form of extrapolation 
is necessary. However, this introduces inevitably an un- 
desirable degree of uncertainty in the data that can be 
anyway reasonably reduced by an appropriate control of 
the possible sources of errors. 

In this work we construct our set of reference RPA 
correlation energies based on two leading criteria: (1) we 
employ as a basis for the extrapolation the best results 
at our disposal (i.e. those from V5Zcv to V7Zcv cal- 
culations), in order to recover as much as possible the 
"theoretical" asymptotic converge of the correlation en- 
ergy that underlies all extrapolation formulas and thus 
reduce the extrapolation error. For this reason we pre- 
fer to exclude data from V4Zcv calculations (or lower) 
that may increase the computational noise due to the 
incompleteness of the basis set. (2) We fix our best esti- 
mate of the RPA correlation energy by averaging over the 
data obtained from several flexible three-parameters ex- 
trapolation formulas, in order to avoid the possible bias 
characteristic of a specific extrapolation scheme. A sim- 
ilar procedure was recently applied for the construction 
of a benchmark set of CCSD(T) CBS energies In 



addition, we consider in detail the evolution of different 
extrapolated energies with dimension of the basis sets to 
fix reasonable bounds to our uncertainty. 

In more detail, we consider the popular extrapolation 
formula 51, 79 



E„ 



A 



[n + d)3 



and its variant proposed by Martin [8l| 



En — Ea 



A 



{n + d)^ 



(1) 



(2) 



where A and d are parameters and Eoo is the CBS limit of 
the correlation energy. The free parameter d accounts for 
the incompleteness of the basis set angular-momentum 
saturation [52] and also partially introduces effectively 
higher-order contributions to the asymptotic correlation 
expansion, as is readily recognized by considering the al- 
ternative form for Eq. ([1]) 



En — En 



A 

w 



i=l 



Ad' 



(3) 



In addition, we use the extrapolation formula proposed 
by Bakowies 52, H^j 



En = En 



A 



(4) 



with A and a adjustable parameters. Again, a pro- 
vides an effective correction for deviations from the lead- 
ing asymptotic behavior cx [H^l, by partially adding 
high-order terms to the expansion. 

The RPA correlation energies of several atoms and 
molecules as resulting from V5Zcv, V6Zcv, V7Zcv calcu- 
lations and the extrapolations with formulas ©-(111) are 
reported in Tab. [H together with our best estimate for 
the CBS limit RPA correlation energy. All the extrapo- 
lation methods yield very close results, with differences 
< 1 mHa, justifying our assumption of taking as best 
estimate their average value. 

We define in addition, in the last column of Tab HI the 
error 5 associated with of our benchmark energies in the 
following way. First, we note that numerical evidence 
shows that for the two limiting cases of an inverse cu- 
bic extrapolation (i.e. using Eq. ([1]) with d — Q) and of 
an exponential extrapolation, the exact CBS RPA cor- 
relation energy is approached from below and above, re- 
spectively. This is shown for some representative cases 
in Fig. m where the results of the two extrapolations 
with increasingly large basis sets are reported, and can 
be rationalized as follows: on one hand the exponential 
extrapolation is well known to underestimate the CBS 
limit [iOl because of the too fast rate of variation with 
the basis set (exponential rather than with a power law), 
on the other hand the simple inverse cubic extrapolation 
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TABLE I: RPA correlation energies of several atoms and molecules for different basis sets and extrapolation schemes. The last 
two columns report the best estimate (average of the extrapolated values) and the estimated uncertainty S. At the bottom of 
the table we report for the raw data from different basis sets the mean error (ME), mean absolute error (MAE), mean absolute 
relative error (MARE), maximum absolute deviation (MAD), and standard deviation with respect to the best estimate. All 
results are in mHa. 



System 


V5Zcv 


V6Zcv 


V7Zcv 


V7ZCV+ 


l/{n + df 




1/n" 


Best est. 


5 


H 


-20.3 


-20.6 


-20.7 


-20.9 


-21.0 


-21.0 


-21.0 


-21.0 


0.1 


C 


-284.9 


-288.5 


-290.1 


-292.0 


-292.6 


-293.0 


-292.9 


-292.8 


0.7 


N 


-327.8 


-332.8 


-334.9 


-336.8 


-337.2 


-337.5 


-337.1 


-337.3 


0.6 


O 


-416.5 


-423.4 


-426.7 


-430.4 


-431.5 


-432.0 


-431.9 


-431.8 


0.9 


F 


-504.1 


-513.1 


-517.5 


-522.0 


-523.9 


-524.6 


-524.4 


-524.3 


1.3 


Ne 


-579.2 


-590.5 


-595.6 


-600.5 


-601.9 


-602.6 


-602.0 


-602.2 


1.3 


H2 


-80.0 


-80.5 


-80.7 




-81.2 


-81.1 


-81.2 


-81.2 


0.2 


NH 


-399.3 


-404.8 


-407.2 




-410.4 


-410.7 


-410.4 


-410.5 


0.6 


NH2 


-468.1 


-474.1 


-476.8 




-480.4 


-480.9 


-480.6 


-480.6 


0.7 


CH4 


-491.4 


-496.3 


-498.6 




-501.4 


-501.7 


-501.4 


-501.5 


0.6 


NH3 


-533.4 


-539.3 


-541.8 




-544.5 


-544.9 


-544.5 


-544.6 


0.7 


H2O 


-565.7 


-573.8 


-577.4 




-581.7 


-582.3 


-581.8 


-581.9 


0.9 


FH 


-582.1 


-591.9 


-596.3 




-602.0 


-602.7 


-602.2 


-602.3 


1.1 


C2H2 


-756.5 


-765.3 


-769.4 




-774.7 


-775.3 


-774.9 


-775.0 


1.1 


CN 


-774.5 


-783.9 


-787.8 




-792.3 


-792.9 


-792.3 


-792.5 


1.0 


HCN 


-799.8 


-809.6 


-813.8 




-818.7 


-819.3 


-818.7 


-818.9 


1.0 


CO 


-821.6 


-832.8 


-837.5 




-842.8 


-843.5 


-842.7 


-843.0 


1.2 


N2 


-833.6 


-844.6 


-849.3 




-855.1 


-855.8 


-855.2 


-855.4 


1.2 


C2H4 


-833.4 


-842.5 


-846.7 




-852.3 


-852.9 


-852.5 


-852.6 


1.1 


HCO 


-865.9 


-877.5 


-882.5 




-888.4 


-889.1 


-888.4 


-888.6 


1.2 


H2CO 


-909.8 


-921.6 


-926.7 




-932.7 


-933.5 


-932.7 


-933.0 


1.2 


O2 


-973.1 


-987.6 


-993.9 




-1001.5 


-1002.4 


-1001.6 


-1001.8 


1.5 


H3COH 


-984.0 


-996.6 


-1002.1 




-1008.7 


-1009.5 


-1008.8 


-1009.0 


1.3 


HOOH 


-1077.2 


-1092.6 


-1099.4 




-1107.6 


-1108.6 


-1107.7 


-1108.0 


1.6 


F2 


-1125.2 


-1143.9 


-1152.3 




-1163.0 


-1164.3 


-1163.4 


-1163.6 


2.4 


ME 


17.9 


9.0 


5.1 




0.2 


-0.4 


0.1 






MAE 


17.9 


9.0 


5.1 




0.2 


0.4 


0.1 






MARE 


2.72% 


1.40% 


0.80% 




0.04% 


0.05% 


0.02% 






MAD 


38.4 


19.7 


11.3 




0.5 


0.7 


0.3 






Std. Dev. 


8.7 


4.4 


2.5 




0.1 


0.2 


0.1 







is likely to overestimate the CBS limit because the 1/?!'^ 
behavior is strictly valid only asymptotically, while for 
real operative conditions a faster rate of variation with 
the basis set shall be expected (hence the d parameter 
in Eq. ([1])). The best extrapolated values obtained with 
this two procedures can thus be considered as upper and 
lower bounds for the true CBS RPA correlation energy. 
Then, after an analysis of the trends of Fig. [1] we as- 
sume as an estimation of the error half of the maximum 
difference between our best estimate and any of the two 
bounds for the energy. 

The reference energies of Tab. Uhave thus an estimated 
accuracy of about 1 mHa or less, except for the last four 



entries of the table, which display slightly larger errors, 
with a maximum of 2.4 for F2. We note however, that 
these systems also have the largest absolute correlation 
energies. In fact, the relative errors for all systems are 
very low ranging between 0.11% and 0.25%, with a mean 
absolute relative error of 0.16%. 

To further validate our results we considered for the 
atoms also RPA calculations with an extremely large ba- 
sis set (VTZcv-l-), constructed by adding to the V7Zcv 
basis set centered on the atom additional uncontracted 
basis functions (up to angular momentum g) centered at 
six points located on the Cartesian axes at ±1 Bohr of 
distance from the atom, for a total of 1113 basis func- 
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FIG. 1: Extrapolated RPA correlation energies (mHa) of the 
O and Ne atoms and the C2H2 molecule as a function of 
different combinations of basis sets. The label nm-ext denotes 
an extrapolation based on Eq. ([T]) with d — and using 
VnZcv and VmZcv basis sets; the label nmfc-exp denotes an 
exponential extrapolation using VnZcv, VmZcv, and VfcZcv 
basis sets. The dashed lines indicate our best estimate for the 
CBS RPA energy. The a;-axis uses an inverse cubic scale. 



tions per atom (723 for the H atom). The resulting basis 
set allows to describe effectively very high angular mo- 
mentum contributions, thanks to the composition of mo- 
menta from functions at different centers, without the 
need to effectively include basis functions with such high 
values of angular momentum into our basis set and par- 
tially avoiding problems related to the linear dependence 
of basis functions. The results of the V7Zcv+ calcula- 
tions, reported in Tab. HI show that indeed by using a 
very large basis set the estimated RPA CBS energy are 
approached well, with deviations that are always close or 
lower than the expected uncertainty of the reference val- 
ues. This result provides a good support for the accuracy 
of our estimation of the CBS limit of the RPA correlation 
as well as for the the corresponding errors. 

The procedure outlined in this section, and whose re- 
sults are reported in Tab. HI allows therefore to produce 
an accurate set of CBS RPA correlation energies, improv- 
ing of several mHa (about 5 to 10) with respect to V7Zcv 
results (see bottom part of Tab. Even more important 
improvements, up to 20 mHa, are found with respect to 
V6Zcv calculations. The proposed set, despite its hmited 
dimension, provides in addition a representative selection 
of molecules constituted from first-row atoms, including 
doublet and triplet ground states. Thus it offers a valu- 
able tool for the benchmarking of RPA correlation ener- 
gies. 



TWO-POINT EXTRAPOLATIONS 

In this section we seek for globally optimized parame- 
ters d and a to be used in Eq. (P), ([2|), and (|4]), capable of 
yielding accurate RPA energies with respect to reference 
values. The use of global parameters in the extrapola- 
tion formulas has in fact several advantages in practical 
applications: (1) it allows to compute the CBS energy 
through two-point extrapolation formulas instead of us- 
ing a fitting procedure; (2) with a careful optimization 
of the global parameters it allows to effectively include 
(on average) high-order effects into the extrapolation, so 
that smaller basis sets can be used; (3) it produces more 
systematic errors, favoring error cancellation in many ap- 
plications. The search for global parameters is also moti- 
vated in the present study by the observation that in the 
individual fittings of RPA correlation energies performed 
in the previous section we observed that for each formula 
all the optimal values of c? or a were contained in a rather 
narrow range. Thus, it is conceivable that "average" val- 
ues of the parameters may exist which produce accurate 
estimates of the CBS limit. 

For our purpose we consider the two-point extrapola- 
tion formulas 

^ _ En{n + dppt)^ - Emim + rfppt)^ 
°° {n + dopt)^ - {m + doptY 

p _ En{n + doptY - Ernjm + dopt)^ 

{n + dopty-{m + doptr ^ ' 

which are derived directly from Eqs. (H)), ([1]), and (|3]), re- 
spectively, considering two basis sets of cardinal number 
n and m. 

The optimization of the global parameters dopt and 
aopt was performed using either n = 7, m = 6 or n = 6, 
TO = 5, and minimizing the mean absolute error with re- 
spect to the reference data of Tab. H] The use of a smaller 
basis set (n = 5, m = 4) resulted instead into a failure of 
the minimization procedure, because no reasonable com- 
promise value was found for dopt and Oopt), reflecting the 
difficulty of the V4Zcv basis set to correctly describe the 
RPA correlation. 

The values of the optimized parameters as well the cor- 
responding results for the test systems and global statis- 
tics are reported in Tab. [Hi We see that all methods 
yield similar results in good agreement with the refer- 
ence values. The differences between the various extrap- 
olation formulas as well as between data obtained using 
a 67-extrapolation and a 56-extrapolation are negligible 
(< 0.4 mHa). This indicates that the optimized global 
parameters can in fact describe very well the correla- 
tion convergence behavior, effectively taking into account 
higher-order contributions. Concerning general trends, 
we observe that 56-extrapolations appear to slightly un- 



5 



TABLE II: RPA correlation energies of test atoms and molecules for different basis sets and global extrapolation schemes. 
The last lines report the mean error (ME), the mean absolute error (MAE), the mean absolute relative error (MARE), the 
maximum absolute deviation (MAD) from reference, and the standard deviation of each data set. All results are in mHa. 



Optimized param. 


67-extrapolation 




56- 


extrapolation 




Ref. 


1/in + df 
dopt —1.33 


l/{n + df 
dopt = 0.37 




!/(•„ + d)3 

/ V y 

r/„„+ — — 1 17 


l/(n + d)^ 


l/rf 
cy^^+ = 3 82 


H 


-20.9 


-20.9 


-20.9 


-20.8 


-20.8 


-20.8 


-21.0±0.1 


c 


-292.1 


-292.1 


-292.1 


-292.0 


-292.0 


-292.0 


-292 8±0 7 


N 


-337.5 


-337.5 


-337.5 


-337.8 


-337.8 


-337.8 


-337 3±0 6 


o 


-431.0 


-431.0 


-431.0 


-430.2 


-430.2 


-430.2 


-431 8±0 9 


F 


-523.1 


-523.1 


-523.1 


-522.1 


-522.1 


-522.1 


-524.3±1.3 


Ne 


-602.0 


-602.0 


-602.0 


-601.8 


-601.8 


-601.8 


-602.2±1.3 


H9 


-81.1 


-81.1 


-81.1 


-81.0 


-81.0 


-81.0 


-81.2±0.2 


NH 


-410.4 


-410.4 


-410.4 


-410.2 


-410.2 


-410.2 


-410 5±0 6 


NH2 


-480.3 


-480.3 


-480.3 


-480.0 


-480.0 


-480.0 


-480 6±0 7 


CH4 


-501.4 


-501.4 


-501.4 


-501.3 


-501.3 


-501.3 


-501.5±0.6 


NH3 


-544.9 


-544.9 


-544.9 


-545.3 


-545.3 


-545.3 


-544 6±0 7 


H2O 


-581.9 


-581.9 


-581.9 


-581.8 


-581.8 


-581.8 


-581 9±0 9 


FH 


-602.0 


-602.0 


-602.0 


-601.7 


-601.7 


-601.7 


-602.3±1.1 


C2H2 


-774.5 


-774.5 


-774.5 


-774.0 


-774.0 


-774.0 


-775. Oil. 1 


CN 


-792.8 


-792.8 


-792.8 


-793.2 


-793.2 


-793.2 


-792 5±1 


HCN 


-819.2 


-819.1 


-819.2 


-819.4 


-819.4 


-819.4 


-818.9±1.0 


CO 


-843.4 


-843.4 


-843.4 


-843.9 


-843.8 


-843.9 


-843. Oil. 2 


N2 


-855.4 


-855.4 


-855.4 


-855.5 


-855.4 


-855.5 


-855.4±1.2 


C2H4 


-852.1 


-852.0 


-852.0 


-851.5 


-851.5 


-851.5 


-852.6±1.1 


HCO 


-888.8 


-888.8 


-888.8 


-889.0 


-889.0 


-889.0 


-888 6±1 2 


H2CO 


-933.1 


-933.1 


-933.1 


-933.3 


-933.3 


-933.3 


-933 0+1 2 


O9 


-1001.9 


-1001.9 


-1001.9 


-1002.0 


-1001.9 


-1001.9 


-1001 8±1 5 


H3COH 


-1009.0 


-1009.0 


-1009.0 


-1009.1 


-1009.0 


-1009.0 


-1009 Oil 3 


HOOH 


-1108.0 


-1107.9 


-1108.0 


-1108.0 


-1107.9 


-1108.0 


-1108.0±1.6 


F2 


-1163.0 


-1163.0 


-1163.0 


-1162.4 


-1162.3 


-1162.4 


-1163.6±2.4 


ME 


0.1 


0.2 


0.2 


0.2 


0.3 


0.2 




MAE 


0.3 


0.3 


0.3 


0.6 


0.6 


0.6 




MARE 


0.07% 


0.07% 


0.07% 


0.13% 


0.13% 


0.13% 




MAD 


1.1 


1.2 


1.1 


2.2 


2.2 


2.2 




Std. Dev. 


0.4 


0.4 


0.4 


0.8 


0.8 


0.8 





derestimate in general the RPA correlation energy, while 
this effect is reduced for 67-extrapolations. We stress 
however, that because the differences between different 
methods and with reference data is well below our esti- 
mated accuracy for the benchmark set, no quantitative 
conclusions can be drawn from the results of Tab. |lTl and 
to practical purposes all the methods must be considered 
completely equivalent. 

As a final note it is instructive to highlight that the 
global optimization of the parameters provides also valu- 
able indications on the convergence behavior of different 
formulas. Thus, the values obtained from the optimiza- 



tion procedure can be used to shed some light on the 
quality of different extrapolation schemes. For exam- 
ple, in Refs. [52|, |53| it is indicated how aopt provides 



a measure for the rate of convergence of the extrapola- 
tion. In our case, the values found for OLopt are greater 
than the ideal asymptotic value of 3, indicating the need 
for an overweight of the energy obtained with the largest 
basis set with respect to the other one in the extrap- 
olation formula (O, similarly to that found for CCSD 
calculations 64! . This means that for the considered ba- 
sis sets the rate of change of the correlation energy with 
increasing basis set is larger than it should be asymptoti- 
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d or a 



FIG. 2: Relative weights (Eq. ((Sl) of the energies used in 
Eqs. ([Sjl, ((6]), and for different values of the d and a pa- 
rameters. The dashed lines indicate the location of optimized 
parameters. 



cally, i.e. the basis sets are rather inadequate to describe 
the CBS limit. This is not surprising for RPA calcula- 
tions that are well known to converge very difficultly to 
the CBS limit and nicely explains the fact that for 56- 
extrapolation a slightly larger value of aopt was fond than 
for 67-extrapolation. 

For the other two formulas (Eqs. ^ and ([6])) similar 
considerations apply. In fact, it is possible to see that all 
the optimized parameters provide, in the extrapolation 
procedure, the same relative (over)weight of the energy 
obtained with the largest basis set with respect to the 
other one in all formulas. This is made evident in Fig. [21 
where we plot the ratios 



(n -I- df 



{n + df 



(m + df ' (m + dy 



^ (8) 



for n = 7, m = 6 (top panel), n — 6, m = 5 (lower 
panel), at several values of d or a. The coincidence of 
the relative weights for different formulas provides thus 
a rationale for the fact that all formulas yield practically 
the same results (note that input energies are the same in 
all formulas at the same level of extrapolation) . Inspec- 
tion of the figure also indicates that the relative weights 
obtained from optimized expressions (1.78 and 1.98 for 
67-extrapolation and 56-extrapolation, respectively) are 
not far from the "ideal" ones, obtained considering a = 3 
(1.60 and 1.74, respectively). 



SEMIEMPIRICAL EXTRAPOLATION FROM 
SMALL BASIS SETS 

As discussed in the previous section, for smaller basis 
sets it is not possible to find accurate global parameters 
to extrapolate the RPA correlation energy to the CBS 



limit. This problem is clearly related to the inadequacy 
of the V4Zcv basis set which makes the progression to 
V5Zcv results not regular enough to be extrapolated with 
a global formula. The problem can be however partially 
circumvented by employing a suitable technique specifi- 
caly developed for extrapolation from small basis sets. 

In wave function theory, e.g. for CCSD(T) calcula- 
tions, two main strategies are used to this end: either 
considering effective empirical parameters for the ex- 
trapolation [H, [13, El, [s^] or considering the relatively 
inexpensive MP2/CBS limit plus a ACCSD(T) correc- 
tion evaluated with a smaller basis set [s^l. This lat- 
ter approach is however not straightforward to extend to 
RPA calculations, because it is based on the fact that 
CCSD(T) and MP2 energies display a very similar basis 
set dependence. This similarity is not verified by RPA 
and MP2 energies, which instead have a markedly dif- 
ferent basis-set behavior (e.g. for the Ne atom passing 
from the TZcv to the QZcv basis set the energy changes 
by 60, 32, and 31 mHa for RPA, CCSD(T), and MP2 
respectively; changing from the QZcv to the 5Zcv level 
the change is 25, 11, and 12 mHa for RPA, CCSD(T), 
and MP2 respectively). 

Therefore, in this paper we follow the first strategy 
and, following a similar approach as that proposed by 
Bakowies 5^ |53| for MP2 and CCSD calculations, we 
introduce atom-dependent parameters which account for 
the irregular behavior in the progression of V4Zcv-V5Zcv 
energies. The CBS energy is defined as 



EmrrC 



rf' — rrp 



where n — 5, m — 4, and 
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(9) 



(10) 



with the index i running over all atoms in the system, N 
the total number of atoms, and rii the number of elec- 
trons of the «-th atom. The atomic parameters 7i are 
fixed by fitting Eq. ^ for single atoms and a minimal set 
of homodimers (II2, N2, 02,F2) except for carbon where 
C2H2 was considered. Considering only atomic data is 
in fact not sufficient to yield accurate and balanced re- 
sults for all the systems (the mean absolute error and 
the mean relative errors are twice as big as in the case 
of the minimal trainig set and the maximum deviation is 
4.7 mHa for HOOH), while a large training set would be 
likely to lead to artifacts in the fitting. The minimal set 
considered here provides instead a good accuracy limit- 
ing as much as possible an arbitrary selection of reference 
systems. 

The optimized parameters and the resulting RPA cor- 
relation energies are reported in Tab. IIIII 

The method performs indeed rather well, yielding a 
mean absolute error of only 1 mHa, in line with the es- 
timated accuracy of the reference set, a mean absolute 
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TABLE III: Atomic parameters (7^), extrapolated RPA cor- 
relation energies (E^^^), differences with respect to reference 
energies (AE^^-^), and ratio between AE^^^ and the expected 
accuracy of each reference energy (5), for the systems of the 
test set of Tab III The last lines report the mean error (ME), 
the mean absolute error (MAE), the mean absolute relative 
error (MARE), the maximum absolute deviation (MAD) from 
reference, and the standard deviation the extrapolated eener- 



gies. All results 


arc in 


mHa. 






System 










H 


3.10 


-20.9 


0.1 


1.3 


c 


3.25 


-292.2 


0.6 


0.9 


N 


3.35 


-337.7 


-0.5 


-0.8 





3.23 


-430.9 


1.0 


1.1 


F 


3.15 


-523.3 


1.0 


0.7 


Ne 


3.28 


-602.2 


0.0 


0.0 


Ha 




-81.2 


0.0 


-0.1 


NH 




-410.4 


0.1 


0.1 


NH2 




-480.5 


0.2 


0.2 


CH4 




-502 


-0.5 


-0.8 


NH3 




-547.2 


-2.5 


-3.6 


H2O 




-583.3 


-1.3 


-1.5 


FH 




-603.4 


-1.1 


-1.0 


C2H2 




-774.9 


0.1 


0.1 


CN 




-793.4 


-1.0 


-1.0 


HCN 




-819.8 


-0.9 


-0.9 


CO 




-844.8 


-1.8 


-1.5 


N2 




-855.2 


0.2 


0.1 


C2H4 




-852.5 


0.1 


0.1 


HCO 




-890 


-1.4 


-1.2 


H2CO 




-934.7 


-1.7 


-1.4 


O2 




-1003.1 


-1.3 


-0.9 


H3COH 




-1010.7 


-1.7 


-1.3 


HOOH 




-1110.1 


-2.1 


-1.3 


F2 




-1165.1 


-1.5 


-0.6 


ME 






-0.7 




MAE 






0.9 




MARE 






0.16% 




MAD 






2.6 




Std. Dev. 






1.0 





relative error of 0.16%. Only in nine cases over twenty 
five the extrapolation errors exceed the intrinsic accuracy 
of the reference energies (i.e. lAE^'^f /5 > 1), while in aU 
cases the extrapolated energies improve over the raw data 
obtained at the V5Zcv level and even with respect to the 
raw V7Zcv data. 

The present semiempirical extrapolation scheme can 
thus be a valuable tool for RPA calculations on large 
systems, as it allows to achieve a good accuracy at a rel- 



TABLE IV: Statistics for the atomization energies computed 
with different approaches with respect to the best estimated 
RPA results (see Tab. [J): ME, mean error; MAE, mean abso- 
lute error; MARE, mean absolute relative error; MAD, max- 
imum absolute deviation; Std. Dev., standard deviation. All 
results are in mHa. For two-point extrapolation schemes the 
results are reported only once, since all the procedures are 
essentially superimposable for atomization energies. 



Method 


ME 


MAE 


MARE 


MAD 


Std. Dev. 


V4Zcv 


-1.5 


1.7 


1.27% 


5.0 


1.6 


V5Zcv 


0.2 


1.2 


0.93% 


3.0 


1.4 


V6Zcv 


0.9 


1.3 


1.05% 


2.6 


1.4 


V7Zcv 


0.9 


1.2 


0.95% 


2.3 


1.1 


56-extr. 


1.5 


1.8 


1.38% 


3.4 


1.5 


67-extr. 


1.0 


1.1 


0.84% 


2.0 


0.8 


semiemp. 


1.0 


1.0 


0.75% 


2.2 


0.6 



atively small computational cost. We recall in fact that, 
since RPA-correlation calculations scale as 0{N^) (which 
can be reduced to 0{N'^ log using the resolution of the 
identity (RI) technique [17[), where TV is the number of 
basis functions, and a cc-pVnZ basis set contains approx- 
imately 

{n+l){n+l)in + 2) 

3 ^ ' 

basis functions ^80J , the computational time required 
by the semiempirical approach is about 204^/91^ ~127 
(30 when RI is used) times faster, i.e. two orders 
of magnitude less, than a V7Zcv calculation and still 
140^/91^ '^13 (6 when RI is used) times faster, i.e. one 
order of magnitude less, than a V6Zcv calculation. 

ATOMIZATION CORRELATION ENERGIES 

To conclude our work we provide in this section a short 
discussion on RPA correlation atomization energies, that 
are defined as energy differences between a molecule and 
its constituent atoms. These can be easily constructed 
from the data of Tabs. HlllTl and lllll or alternatively, for 
methods using a two-point extrapolation formula with 
global (i.e. system independent) parameters, they can be 
obtained by applying the extrapolation formula directly 
to the raw atomization energies (i.e computed with two 
given basis sets). The statistics of atomization energies, 
using as reference data the atomization energies obtained 
from the best estimated RPA energies of Tab. U are 
reported in Tab. IIVI 

The inspection of Tab. IIVI reveals that several ap- 
proaches are capable to yield very good results, with 
mean absolute errors close to 1 mHa and mean rela- 
tive errors smaller than 1%. In particular, all the cal- 
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culations using raw basis-set results (except V4Zcv) per- 
form remarkably well, showing a significantly better per- 
formance with respect to the case of absolute energies, 
thanks to the cancellation of systematic errors in the 
present case. Interestingly little or no benefit comes 
instead from the use of extrapolation techniques. In 
fact, 67-extrapolation results are in line with V7Zcv re- 
sults, while 56-extrapolation atomization energies are 
even worst that V5Zcv ones and of similar quality as the 
V4Zcv results. This result shall be rationalized in terms 
of an inevitable small but random computational noise 
that affects the extrapolation data (especially if small 
basis sets are used as a base for the extrapolation). The 
small inaccuracies of extrapolated data can be in fact al- 
most irrelevant for absolute energies (below 1 mHa) , but 
can easily sum to several mHa in the case of atomization 
energies (especially for many atoms molecules). Finally, 
we remark the good performance of the semiempirical 
extrapolation method, which yields atomization energies 
in good agreement with the reference ones, with a mean 
absolute relative error of only 0.75% and very small max- 
imum absolute deviation and standard deviation. 

The results of Tab. llVlshow that the quality of each ex- 
trapolation procedure to yield accurate atomization ener- 
gies cannot be directly inferred from the results obtained 
for absolute energies, because atomization energies are 
energy differences and thus a complex error propagation 
can occur. As we showed this effect is more important 
for low-level approaches and is somehow amplified for 
extrapolated results, because of the presence of unsys- 
tematic errors due to the numerical procedure. It is thus 
finally important to note that this issue applies also for 
our reference atomization energies, that were obtained 
from the best estimated RPA absolute energies of Tab. 
m although in this case we may expect the effect to be 
rather limited due to the high quality of the extrapola- 
tion. Nevertheless, we have to remark that possibly er- 
rors of few mHa are plausible for some systems, so that in 
general the set cannot be fully considered as an accurate 
benchmark set of atomization energies, although it could 
be safely considered as reference for a qualitative discus- 
sion of the trends obtained from different extrapolation 
methods. 



CONCLUSIONS 

The slow basis set convergence of the RPA correla- 
tion energy is a major problem for the development and 
the application of the method in quantum chemistry and 
solid-state physics. In fact, a careful assessment of the 
RPA methodology and related methods (e.g., RPA-|- 14| . 
RPA-I-SE pj|), requires the existence of a set of bench- 
mark absolute and relative RPA-correlation energies as 
well as the availability of accurate extrapolation tech- 
niques. Practical applications, on the other hand, would 



greatly benefit from efficient extrapolation schemes with 
well calibrated levels of confidence. 

Surprisingly however only little effort has been dedi- 
cated in literature to study the CBS limit of RPA cor- 
relation energies and assess the effectiveness of different 
extrapolation procedures in this context. In this paper 
we aimed at pushing this work one step further and con- 
sidered the problem in a more systematic way, construct- 
ing a benchmark set of reference RPA-correlation ener- 
gies and studying the behavior of different extrapolation 
schemes in the RPA framework. Our work can be sum- 
marized in the following conclusions: 

• The extremely slow convergence of the RPA corre- 
lation energy with the basis set dimension prevents 
the computation of highly accurate reference ener- 
gies. Already at the V7Zcv level in fact the energies 
can be incorrect up to 10 mHa. Nevertheless, we 
showed that a careful extrapolation procedure al- 
lows to construct an accurate benchmark set with 
a well defined estimation of the uncertainty. In the 
present work we estimate our reference data to have 
an accuracy close to 1 mHa for most of the systems. 

• We analyzed several two-point extrapolation for- 
mulas against our benchmark set of absolute RPA 
correlation energies. It turns out that good results 
can only be achieved if global parameters are opti- 
mized in Eqs. ([5]), ([6]), and ([7]), as to effectively re- 
produce high-order terms in the theoretical asymp- 
totic energy expansion (oc l/'n?). When this proce- 
dure is followed all the considered formulas provide 
equivalent results. Moreover, with a careful opti- 
mization of the global parameters very good results 
can also be achieved from 56-extrapolation proce- 
dures. 

• RPA correlation energies evaluated at the V4Zcv 
level appear to be of too low quality to serve as a 
basis in an extrapolation procedure with globally 
optimized parameters. This fact indicates that in 
the case of 45-extrapolations high-order terms be- 
yond the one, play a crucial role and are thus 
dominant in the asymptotic expansion. Therefore, 
RPA energies obtained at the quadruple-zeta level 
of theory cannot be considered well converged with 
respect to the one-particle expansion in basis set 
and shall not be used in applications. Nevertheless, 
the introduction of atomic-based parameters within 
a semiempirical extrapolation scheme can strongly 
reduce the problem and provide extrapolated RPA 
absolute energies of good quality (within 2 mHa 
from the reference). The semiempirical extrapola- 
tion method can thus be considered as a good tool 
for the CBS extrapolation in RPA applications on 
large systems. 
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• When energy differences are considered, an un- 
predictable error propagation occurs and accurate 
CBS results are even more hard to achieve than 
the absolute correlation energies. However, because 
an error cancellation occurs for the systematic er- 
rors (e.g. systematic underestimation of the en- 
ergy by incomplete basis sets) we finally found that 
most approaches yield results that agree within 2 
mHa. Because no highly accurate reference data 
exist to assess such small differences, no clear pref- 
erence can be expressed for any of the considered 
approaches and 2 mHa 1.2 kcal/mol, ~ 0.05 
meV) must be considered the level of confidence 
of state-of-the-art RPA calculations on atomization 
energies. Therefore, caution must be always em- 
ployed when small energy difference are considered. 

We mention finally that, following the encouraging re- 
sults of the present study, further investigations will be 
needed to assess in detail the basis set-dependence of 
RPA energies and the accuracy of extrapolation schemes 
for energy differences (e.g. atomization energies), which 
are a main quantity in many practical computational 
studies. In fact, when energy differences are considered, 
an unpredictable error propagation may occur and ac- 
curate CBS results are harder to achieve than the abso- 
lute correlation energies, despite the former can be di- 
rectly derived as an algebraic sum of the latter. From 
our calculations it can be easily seen indeed that all the 
approaches considered in this work yield very similar at- 
omization energies for the molecules of the test set, with 
differences mostly below 2-3 mHa. However, no clear 
trend can be identified between the different approaches, 
so that no highly accurate reference data can be estab- 
lished to assciss such small differences. This traces back 
to the fact that systematic errors (e.g. systematic under- 
estimation of the energy by incomplete basis sets) may 
be expected to cancel out in this case, but extrapolation 
procedures may introduce in general unsystematic errors 
which worsen the results with respect to the raw data 
obtained from the corresponding basis sets. 
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